Beyond histograms: why learned structure-preserving descriptors outperform HOG
نویسندگان
چکیده
Statistical image descriptors based on histograms (e.g. SIFT [1], HOG [2]) are widely used in image processing, because they are fast and simple methods with high classification performance. However, they discard the local spatial topology and thus lose discriminative information contained in the image. We discuss the relations between HOG and VNMF descriptors, i.e. structure free histograms versus learned structure-preserving patterns. VNMF is a shift-invariant, sparse, nonnegative unsupervised learning algorithm [8, 9, 5], that provides a distinct decomposition of the input into its parts. The VNMF descriptor outperforms the statistical HOG descriptor, because it preserves spatial topology leading to better classification results on real-world human action recognition benchmarks [11, 12].
منابع مشابه
Histograms of Oriented Gradients for 3D Object Retrieval
3D object retrieval has received much research attention during the last years. To automatically determine the similarity between 3D objects, the global descriptor approach is very popular, and many competing methods for extracting global descriptors have been proposed to date. However, no single descriptor has yet shown to outperform all other descriptors on all retrieval benchmarks or benchma...
متن کامل’Histograms of Oriented Gradients for Human Detection’ versus ’Fast Human Detection Using a Cascade of Histograms of Oriented Gradients’
Dalal and Triggs [1] studied the question of feature sets for robust visual object recognition. They first considered existing edge and gradient based descriptors and then they showed experimentally that grids of Histograms of Oriented Gradients (HoG) descriptors significantly outperform existing feature sets for human detection. After this they studied the influence of each stage of the comput...
متن کاملOne-Shot-Learning Gesture Recognition Using HOG-HOF Features
The purpose of this paper is to describe one-shot-learning gesture recognition systems developed on the ChaLearn Gesture Dataset (ChaLearn). We use RGB and depth images and combine appearance (Histograms of Oriented Gradients) and motion descriptors (Histogram of Optical Flow) for parallel temporal segmentation and recognition. The Quadratic-Chi distance family is used to measure differences be...
متن کاملReal-Time Visual Tracking through Fusion Features
Due to their high-speed, correlation filters for object tracking have begun to receive increasing attention. Traditional object trackers based on correlation filters typically use a single type of feature. In this paper, we attempt to integrate multiple feature types to improve the performance, and we propose a new DD-HOG fusion feature that consists of discriminative descriptors (DDs) and hist...
متن کاملOne-shot-learning Gesture Recognition Using Hog-hof Features Bachelor Thesis One-shot-learning Gesture Recognition Using Hog-hof Features Bachelor Thesis Názov: One-shot-learning Gesture Recognition Using Hog-hof Features
The purpose of this thesis is to describe one-shot-learning gesture recognition systems developed on the ChaLearn Gesture Dataset [3]. We use RGB and depth images and combine appearance (Histograms of Oriented Gradients) and motion descriptors (Histogram of Optical Flow) for parallel temporal segmentation and recognition. The Quadratic-Chi distance family is used to measure differences between ...
متن کامل